CDS

Accession Number TCMCG017C20854
gbkey CDS
Protein Id OMO82309.1
Location complement(join(103414..103806,103909..103974,104101..104166,104262..104375,104466..105435,105533..105582))
GeneID InterPro:IPR011598
Organism Corchorus olitorius
locus_tag COLO4_23118

Protein

Length 552aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA215141, BioSample:SAMN03160584
db_source AWUE01018155.1
Definition hypothetical protein COLO4_23118 [Corchorus olitorius]
Locus_tag COLO4_23118

EGGNOG-MAPPER Annotation

COG_category K
Description helix loop helix domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03000        [VIEW IN KEGG]
KEGG_ko ko:K12126        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04075        [VIEW IN KEGG]
ko04712        [VIEW IN KEGG]
map04075        [VIEW IN KEGG]
map04712        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCAGCAGAGGCCCCTGAATCAATCCCGCAGGCTAATTTTGCATCCACGTCTGATATTCCTGAGTTCTCAGAGCTGGTATGGGAAAATGGTCAAATTCTCTTTCGGGGTTTATCGTCCAAGATAACTAATTATAAAAGGAGTACTACTCGTTTCCCAGGTTATAATTTTTCTCAATCTAATTTCAATTTCAAGGATATAGTTCAACGTGAAGGAGAAATGTCTACTGCAAATGATAATAGATCAAAGGATGGGGTTCAAGAATCTACTTTCGGAGATCCCATATCAGGTCTTACGAAGTTAGATCTTAATCATGATAAAAGTAAGATGAATTCTTACCCTCAAGCAAATTACGCGGAACTCTTGTCGGAATTCTATGAAGATGATTTCAGTGTGAAGCAACTAATTGATTCCCATGTTGTTCCTGCTGTAAACAAATTCAACAGTTTTAAACAAAGTGATGATGATGGGTCTCGAAAGCTTGTCGAAGAGATGCCCCATCTTATGAACAGCGATCATGAGGTACCTGATGATCATCATCATCATCACTCTTCTTTGAAGCAATGTGAAGTTTCTGTTCCGTTTATGAGATCAAATTCTGGGGTTGAGGAGAAAAGGGATAGGGTTAACTTTTCCATGTTTTTAAGATCTGGTGCAACAACAAGGCCAAGGAGCGCCCAGCAGGTTCGAGCTTTGGCAGAGACTGGTGATCAGGATATTTTCGAAAGAAATATTGTAAGATCTGAACAAGGAAATTTAATGGGTAATAACACGAAGCCGATATTAATACCAGATACTCATGAGCCTCAAAAAGAGACACTTCCTGATGAGCAGTCTGAAGCAGTCGGCTATAATCAAGATATCTCTCCAAATTCGAGATCTTCTAAGGGAAATATTCCTTGTGATGGAAAATTAGTTGAGCAAATGGTTGGATCATCTTCAGTATGTTCTCGTGGGGCTTCGAATTGTCCAACATACACTTTGAAAACGAGATATGATCAAGACACTGATCTCAGCGAAAATGCGACAGAAGAGCCGGAGGGGACGACGTCGACAAAAGCACCGCCACCACCACGAGGAAGCAAAGGTGCTAAGAAGAAACGAAAAGCAGAAGTTCATAATCTATCTGAAAGGAAGCGAAGAGATAAAATCAACAAGAAGATGCGTGCATTGCAAGAGCTCATTCCCAACTGTAACAAGGTGGACAAAGCTTCGATGCTGGACGAGGCAATTGAGTATTTAAAAACCCTTCAGTTCCAAGTTCAGATGATGTCGATGGGAAGTGGGTTTTTCATGCCACCAAATCCAATGATGTTACATGCGGCAATGCAACAGATGAATGCACAGAATATGATTGGCCCATATTCTCCCATGGGTGTTGGGATGGCCGGTATGGGTATGGGTATGGGCATGGGGTTCCCAAGTCTGCCTGGAATCAGGGAGGCTAGACTCAACAGCATGATTGGGTTCCCCGGGCAGGTGCCATTAATGTCCATGTTGTCGCCTTCACCTTTTACCGCAAGATTCTGCCCGCAATCTGTCCAGGCCCCTGCGCCTGCTATGCAAATGCAAGTGGAACAACAATTTCCAGTTCCAGGTGTTGCTGCTAATGCAATTCCCCTATCCACATCAAAGGATTCAAATACCACATGTCAGTAA
Protein:  
MAAEAPESIPQANFASTSDIPEFSELVWENGQILFRGLSSKITNYKRSTTRFPGYNFSQSNFNFKDIVQREGEMSTANDNRSKDGVQESTFGDPISGLTKLDLNHDKSKMNSYPQANYAELLSEFYEDDFSVKQLIDSHVVPAVNKFNSFKQSDDDGSRKLVEEMPHLMNSDHEVPDDHHHHHSSLKQCEVSVPFMRSNSGVEEKRDRVNFSMFLRSGATTRPRSAQQVRALAETGDQDIFERNIVRSEQGNLMGNNTKPILIPDTHEPQKETLPDEQSEAVGYNQDISPNSRSSKGNIPCDGKLVEQMVGSSSVCSRGASNCPTYTLKTRYDQDTDLSENATEEPEGTTSTKAPPPPRGSKGAKKKRKAEVHNLSERKRRDKINKKMRALQELIPNCNKVDKASMLDEAIEYLKTLQFQVQMMSMGSGFFMPPNPMMLHAAMQQMNAQNMIGPYSPMGVGMAGMGMGMGMGFPSLPGIREARLNSMIGFPGQVPLMSMLSPSPFTARFCPQSVQAPAPAMQMQVEQQFPVPGVAANAIPLSTSKDSNTTCQ